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Abstract 

Background: Head and neck cancer (HNC) belongs to a group of heterogeneous 
disease with distinct patterns of behavior and presentation. TNFRSFIOB, a tumor 
suppressor gene mapped on chromosome 8. Mutation in candidate gene is 
responsible for the loss of chromosome p arm which is frequently observed in head 
and neck tumors. TNFRSFIOB inhibits tumor formation through apoptosis but 
deregulation encourages metastasis, migration and invasion of tumor cell tissues. 

Results: Structural modeling was performed by employing MODELLER (9vl0). A 
suitable template [2ZB9] was retrieved from protein databank with query coverage 
and sequence identity of 84% and 30% respectively. Predicted Model evaluation 
form Rampage revealed 93.2% residues in favoured region, 5.7% in allowed region 
while only 1 residue is in outlier region. ERRAT and ProSA demonstrated 51.85% 
overall quality with a -1.08 Z-score of predicted model. Molecular Evolutionary 
Genetics Analysis (MEGA 5) tool was executed to infer an evolutionary history of 
TNFRSFIOB candidate gene. Orthologs and paralogs [TNFRSFIOA & TNFRSFIOD] 
protein sequences of TNFRSFIOB gene were retrieved for developed ancestral 
relationship. Topology of tree presenting TNFRSFIOA gene considered as outgroup. 
Human and gorilla shared more than 90% similarities with conserved amino acid 
sequence. Virtual screening approach was appliedfor identification of novel inhibitors. 
Library (Mcule) was screened for novel inhibitors and utilized the scrutinized lead 
compounds for protein ligand docking. Screened lead compounds were further 
investigated for molecular docking studies. STRING server was employed to explore 
protein-protein interactions of TNFRSFIOB target protein. TNFSFIO protein showed 
highest 0.999 confidence score and selected protein-protein docking by utilizing 
GRAMM-X server. In-silico docking results revealed 1-58, S-90 and A-62 as most active 
interacting residues of TNFRSFIOB receptor protein with R-130, S-156 and R-130 of 
TNFSFIOB ligand protein. 
(Continued on next page) 
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Conclusion: Current research may provide a backbone for understanding structural 
and functional insights of TNFRSFIOB protein. The designed novel inhibitors and 
predicted interactions might serve to inhibit the disease. Effective in-vitro potent 
ligands are required which will be helpful in future to design a drug to against Head 
and neck cancer disease. There is an urgent need for affective drug designing of head 
and neck cancer and computational tools for examining candidate genes more 
efficiently and accurately are required. 

Keywords: Head and neck cancer, Modeling, Tumor necrosis factor, TNFRSFIOB, 
Docking, MODELLER, Phylogenetic, Virtual screening. Inhibitors, Bioinformatics 



Background 

HNC is the sixth most occurring cancers worldwide [1] whereas in Pakistan, the 
second most prevalent cancer affecting the pharynx, larynx and oral cavity [2]. 
Multigenic nature and environmental agents made heterogeneous and complex epi- 
demiology of disease [3]. Different genetic polymorphisms are reported in enzymes 
involved in the metabolization of alcohol and tobacco greatly increases the risk of 
Squamous Cell Carcinoma of Head and Neck cancer (SCCHN) [4,5]. Morbidity and 
prognosis differ from patient to patient depending on causative agents, anatomical 
site and the stage of disease. 

DNA modifications and structural variations in the genomic content of cell controlling 
gene expression are responsible for cancers. Deletions and duplications of chromosomal 
segments or even whole chromosome lead to the genomic instability causing genetic alter- 
ations [6]. DNA modifications are greatly responsible for change in expression level of 
HNC [6,7]. Genetic events result in the activation of proto-oncogenes and inactivation of 
tumor suppressor genes or both, leading to the development of SCCHN [7,8]. 

Tumor Necrosis Factor (TNF) is a mediator pro-inflammatory cytokine involved in the 
progression and development of cancer. The family of TNF inhibits tumor formation 
through apoptosis but TNFs deregulation encourages metastasis, migration and invasion of 
tumor cell tissues [9]. TNFRSFIOB gene consists of 10 gene coding exons. Sequence ana- 
lysis of all exons suggested allelic loss of 8p was in 20 primary HNCs. A number of putative 
tumors suppressor genes are located on Bp region which is a frequent site of translocations 
in head and neck tumors. In 1998, 2-bp insertion in this gene at residue 1065 was found 
that introduces a premature stop codon lead to truncated protein in SCCHN. Sequence 
comparison between patient and normal tissues confirmed that germ line contains truncat- 
ing mutation in the absence of p53 mutation [10]. 

In-silico analysis of TNFRSFIOB gene was conducted to elucidate the novel mole- 
cules, interacting partners, their binding interactions and to find a most plausible func- 
tions. The main objective of our study was to design novel inhibitors. The aim of 
research was to elucidate the interactions of TNFRSFIOB protein with novel inhibitors 
and to identify the relation of gene with disease. 

Results 

The current work presents bioinformatics analysis of TNFRSFIOB, a candidate gene of 
HNC. TNFRSFIOB gene mapped on chromosome 8, started from 22877646 bp and 
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Table 1 Molecular functions, biological processes and cellular locations of TNFRSF10B gene 



Gene Molecular function Biological process Cellular location 





Apoptotic process 




Receptor activity 


Extrinsic apoptotic signalling pathway via 
death domain receptors. 


Plasma membrane 


Protein binding 




Integral to membrane 


TNFRSFWB 


Regulation of apoptotic process. 




TRAIL binding 


Activation of cysteine-type endopeptidase 
activity involved in the apoptotic process. 





ends with 22926692 bp. Molecular functions, biological processes and cellular locations 
of TNFRSFIOB gene are mentioned in Table 1. 

Protein sequence of TNFRSFIOB in FASTA format was retrieved from UniprotKB 
with accession number E9PBT3. Table 2 represents the templates of TNFRSFIOB pro- 
tein selected on overall quality, total score and query coverage. All the three selected 
templates were used for three dimensional structure predictions by comparative mod- 
eling. The best model was built by MODELLER (9vl0) [11] by using 2ZB9 template 
with optimal alignment. Predicted model was visualized by Chimera 1.6 [12] shown in 
Figure 1. Assessment of predicted structure by Rampage, ERR AT and ProSA is shown 
from Figures 2, 3 and 4 respectively. 

The evolutionary tool MEGA 5 [13] was employed to construct a neighbor- joining 
tree of TNFRSFIOB gene. Ensembl BLAST (http://www.ensembl.org/Multi/blastview) was 
performed to identify paralogs of the target gene. Protein sequences of TNFRSFIOA, 
TNFRSFIOD and TNFRSFIOB were retrieved to determine the evolutionary relationship 
between paralogs and orthologs. Numbers of bootstrap replications were 1000 in bootstrap 
method. P-distance method and complete deletion option were used in the construction of 
neighbor-joining tree shown in Figure 5. 

Virtual screening technique 

Virtual screening approach was employed to identify competitive compounds that in- 
hibit the mutated TNFRSFIOB activity. In pharmaceutical industry, the approach has 
become progressively more popular for lead identification. The main objective of vir- 
tual screening is to screen a large set of compounds against specific receptor protein to 
identify the manageable number of inhibitors for possibly chance of lead to drug candi- 
date [14]. Four lead compounds (A, B, C and D) structures were screened for further 
analysis shown in Figure 6. 

Bioavailability and membrane permeability are the molecular properties that al- 
ways connected with molecular weight, partition coefficient (logP), number of hydrogen 



Table 2 Template aligned by high score and query coverage 



Accession ID 


Total score 


Query coverage 


E-value 


Sequence identity 


2ZB9 


25.8 


84% 


5.4 


30% 


3NKE 


25.8 


46% 


4.1 


32% 


3NKD 


25.0 


46% 


8.1 


32% 
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Figure 1 3D structure of TNFRSFWB protein visualized by the UCSF CHIMERA visualizing tool. A 

model is presented in smootliing ribbons and sticl<s. Helixes are represented by blue colour and coils by 
green colour. 



bond donors and number of H-bond acceptors as a basic molecular descriptors [15]. 
Lipinski "Rule of Five" was formulated by using these molecular properties [16]. 
According to this rule, molecules with good membrane permeability have log P<5, mo- 
lecular weight <500, hydrogen bond acceptors <10 and donors <5 [17]. Therefore, 
Lipinskis Rule of Five was applied to check the bioavailability characteristics such as 



\\f 0 




Goneral/Pre-Pro/Proline Favoured 
Glycine Favoured 



Number of residues in favoured region (-98.0% expected) 
Number of residues in allowed region (-2.0% expected) 
Number of residues in outlier region 



General/Pre-Pro/Proline Allowed 
Glycine Allowed 

82 (93.2%) 
5 (5.7%) 
1 (1.1%) 



Figure 2 RAMPAGE result of TNFRSFWB protein showing the distribution of residues in favored, 
allowed and outlier regions. 
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Residue # (window center) 

Figure 3 ERRAT result of TNFRSFWB model showing the 51.85% overall quality of model. X-axis 
shows the number of resides while the Y-axis represents error values of residues. 

V J 



absorption, distribution, metabolism, elimination (ADME) of the lead compounds. 
In present work, these properties were determined by Mcule tool [18] mentioned 
in Table 3. 

Toxicity 

High quality lead structures are the requirement for the successful drug discov- 
ery and structures of drug properties are more acceptable than common [19]. In 
the early steps of drug discovery, poor pharmacokinetics and toxicity should be 
eradicated. Toxicity and drug score characteristics were further used to screen the 
hits [20]. 

Docking analysis of TNFRSFIOB protein with screened lead compounds was 
performed by AutoDock and post dock analysis by Chimera 1.6v and Discovery 
Studio. The amino acids present in the active sites of the protein were identified 
by observing those amino acids in the vicinity of 4 A. Residues of receptor pro- 
tein interacting with compounds were calculated and presented in Figures 7 and 8 
by Chimera and Discovery Studio respectively. 

A functional partner network of TNFRSFIOB protein was generated by the 
STRING [21] and STITCH3 [22] databases to explore the highly interacting pro- 
teins of the target protein. TNFSFIO protein having highest interaction score 
0.999 with receptor protein was used as a ligand protein in protein-protein 
docking by GRAMM-X [23] and Hex [24]. Interaction network and protein- 
protein docked complex are shown in Figures 9 and 10 respectively. Interactions 
of interacting residues were determined and analyzed by PyMol mentioned in 
Table 4. 
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(A) Overall Model Quality Plot 



100 



— window size 10 

— window size 40 




Sequence position 
(B) Local Quality plot 

Figure 4 ProSA results of TNFRSFWB model (A) Plot showing -1.08 Z-score representing the overall 
quality of the model. (B) Local model quality plot of model showing position of sequences against 
knowledge based energy in both window sizes 10 and 40. 
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L Gorilla_TNFRSF10B 

Chimpnzee_TNFRSF10B 
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100 



Macaque_TNFRSF1 OD 
Gorilla_TNFRSF10D 
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r— Human_TNFRSF10A 
L Gorilla TNFRSF10A 



Figure 5 Neighbor-joining tree of TNFRSFWB gene showing TNFRSFWA gene as ancestral gene. 

Topology of tree showing TNFRSFWA gene is out group of tree. Species having >50% bootstrap values are 
presented in this tree. 



Discussion 

Head and neck cancer remains a disfiguring disease associated with a high mortality 
rate [25]. Progressive accumulation of genetic aberrations leads to SCCHN but exact 
nature is still unknown. Candidate gene identification approach may provide key factors 
to pinpoint candidate genes involved in different carcinomas, which leads to explore 
the receptor-ligand or protein-protein interactions recognize these carcinomas that 
might lead to the development of effective therapeutic strategies [26] . 

For head and neck candidate gene TNFRSFIOB, 2ZB9, 3NKE and 3NKD templates 
were retrieved from PDB. Out of these three templates, 2ZB9 showed optimal align- 
ment and query coverage. Rampage showed the 93.2% residues in favored region 
and 5.7% in the allowing region whereas only 1 residue was in outlier region. 
51.852% quality factor and -1.08 Z-score were shown by ERR AT and ProSA evalu- 
ation tools respectively. 

In literature, TNFRSFIOB gene and its paralogs genes are predicted in primates and 
human. MEGA 5 was employed using neighbor- joining method to determine evolu- 
tionary relationship of genes among teleosts, rodents, birds, primates and mammals. 
TNFRSFIOD and TNFRSFIOA are the paralogs of TNFRSFIOB gene that are used in 
the construction of a phylogenetic tree. TNFRSFIOA gene is outgroup in TNFRSFIOB 
tree that presents TNFRSFIOA gene as an origin of other genes. TNFRSFIOA gene gave 
rise to TNFRSFIOB gene in Ciona Intestinalis and other cluster is further diverged into 
TNFRSFIOD and TNFRSFIOB genes. Human and gorilla are closely related in 
TNFRSFIOB and TNFRSFIOA genes while in TNFRSFIOD, human is closely re- 
lated with chimpanzee. Bootstrap replication values >50 are presented in phylo- 
genetic tree representing the reliability of topology. 

Novel designed molecules have drug related properties and served as inhibitors 
for candidate gene. Interactions were observed in binding pocket of TNFRSFIOB 
showing polar nature of binding domain. The designed compounds fulfill the 
properties of a competent drug and have no toxicity, mutagenic, irritants and car- 
cinogenic property. 
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TNFSFIO protein showed highest interacting score 0.999 with TNFRSFIOB target 
protein belonging to the same protein family. TNFSFIO protein was used as a lig- 
and for protein-protein docking with TNFRSFIOB receptor protein. Post docking 
analysis was performed by PyMol to analyze the hydrogen, ionic and hydrophobic 
interactions. Only ionic interactions were found in docked complex. Isoleucine-58 
of receptor protein TNFRSFIOB showed ionic interactions with Arginine-130 of 
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Table 3 Drug related properties of the designed molecules 



Inhibitors 


Molecular 
mass 


LogP 


Rotatable 
bonds 


H-bond 
donors 


H-bond 
acceptor 


RoF 
violation 


Interacting residues 


A 


130.077 


-0.7977 


0 


2 


4 


0 


ARG-23, GLU-24, ALA-25, ARG-26, 
GLY-27, ALA-28, VAL-39, VAL-41, 
LEU-46 


B 


158.303 


-0.776 


0 


0 


4 


0 


ARG-23, GLU-24, VAL-39, LEU-40, 
VAL-41, ALA-43, LEU-46 


C 


789.86 


3.5317 


14 


5 


14 


2 


ILE-58, ALA-59, SER-60, ALA-62, 
M^-73, ILE-85, GLN-86, TRP-89, 
SER-90 


D 


807.877 


3.65.05 


14 


5 


15 


2 


PRO-9, ALA-10, SER-12, GLY-13, 
LYS-16, ARG-17 PRO-30, GLN-53, 
LYS-54, GLU-57 



ligand protein TNFSFIO with the distance of 2.9 A, The nitrogen atom of arginine 
showed interaction with oxygen atom of isoleucine. Serine-90 of TNFRSFIOB recep- 
tor protein showed ionic interactions with Serine- 156 of Ugand protein with a bond 
distance of 3.2 A. Nitrogen of serine of Ugand protein TNFSFIO interacted with the 
oxygen of serine of receptor protein. Alanine-62 nitrogen of receptor protein 
TNFRSFIOB interacted with arginine-130 oxygen of Ugand protein TNFSFIO with 
3.2 A bond distance. 




C D 

Figure 7 Binding views of lead compounds (A, B, C and D) with TNFRSFIOB receptor protein. The 

compounds are depicted in pinl< sticl<s wliile protein structures in brown lietromeric surfaces. Interacting 
residues of receptor protein docl<ed witli respective four (A, B, C and D) compounds are determined and 
presented by Cliimera. 
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C D 

Figure 8 Post dock analysis of complexes by Discovery Studio tool. Chemical structures of compounds 
with interactions of receptor protein are shown. A) Interactions of novel designed compound A with receptor 
protein. B) Compound B interacted with TNFRSFIOB. C) Interacted complex of TNFRSFIOB with designed C 
inhibitor. D) Binding residues of compound D with receptor protein. 

V J 

Conclusion 

For protein-protein interaction and novel designed molecules, both functional and ex- 
pressional studies of TNFRSFIOB showed significant interaction. In-vivo experimenta- 
tion could be performed in animal model to check the effect of selected protein 
interactions which may leads to the approved drug of head and neck cancer. More than 
80% homology between human and primates are strong evidence to build ancestral re- 
lationship which will help in prediction of protein functions and family. Current re- 
search suggested a baseline for novel ligand screening, docking and ancestral hierarchy 
for development and validation of novel drugs in particular function prediction of can- 
didate gene TNFRSFIOB. 

Materials and methods 

The sequence of TNFRSFIOB protein in FASTA format was retrieved from 
Uniprot Knowledge base (http://www.uniprot.org/) of accession number E9PBT3. 
The retrieved amino acid sequence of TNFRSFIOB was subjected to a protein- 
protein BLAST (BLASTp) search against the Protein Data Bank (PDB) (www.rcsb.org/) 
[27] to recognize a suitable template structure. Suitable template [PDB ID: 2ZB9] having 
84% query coverage, 30% sequence identity and 5.1 E-value was used in comparative 
modeling of TNFRSFIOB protein. The homology modeling program MODELLER 9vl0 
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Table 4 Interacting residues between TNFRSF10B and TNFSF10 proteins 



Receptor 
protein 


Interacting 
protein 


Interactions 
type 


Interactions (Receptor residue Interacting 
protein residue 


Bond distance 
(A) 


TNFRSFWB 


TNFSFIO 


Ionic bonding (N-0) 


ILE-58/0^ ARG-130/NH1 


2.9 








SER-90/O ^ SER-156/N 


3.2 








ALA-62/N ^ ARG-130/O 


3.2 



was applied to generate 3D models. TNFRSFIOB 3D model having lowest objective func- 
tion was further assessed by Rampage [28], ERRAT [29] and ProSA [30] evaluation tools 
for the reliability of predicted structure. 

Molecular Evolutionary Genetic Algorithm (MEGA 5) was used to infer ancestral his- 
tory and species relationship of TNFRSFIOB gene. Distance based approach through 
neighbor-joining technique was applied to construct the phylogenetic tree by using 
1000 bootstrap replicates. 

The compounds obtained through virtual screening were used in docking analysis by 
AutoDock Vina [31]. The ligand molecules of target protein were not reported in earl- 
ier studies and also not found in biological databases, hence virtual screening technique 
was used to screen drug like lead compounds for TNFRSFIOB docking calculations. 
Four novel lead compounds (as A, B, C and D) were screened. Mcule suit was 
employed for virtual screening and to predict the bioactivity and molecular properties 
of lead compounds. 

LogP value is an important predictor of per oral bioavailability of drug mole- 
cules [32]. Therefore, physiochemical properties such as LogP, molecular mass, ro- 
tatable bonds, hydrogen bond acceptor and donors of 4 selected lead compounds 
were determined. Results showed that compounds (C and D) showed violations of 
two rules of lipinski rule of five suggesting that lead compounds have good bio- 
availability. The selected top scrutinized compounds were minimized through 
VegaZZ [33] and ChemDraw Ultra [34]. Subsequent analysis on selected lead 
compounds were carried out and docking analysis was performed to identify the 
binding affinities by AutoDock. Parameters of AutoDock used in docking are 
mentioned in Table 5. 

The STRING and STITCH3 servers were employed to identify the functional part- 
ners of TNFRSFIOB. These databases are online database of known and predicted pro- 
tein interactions including direct (physical) and indirect (functional) relationships. 
Gramm-X and Hex online servers were applied in protein docking of TNFRSFIOB pro- 
tein with its interactive partner TNFSFIO protein. Post docking analysis of docked 
complex was performed by PyMol tool. 



Table 5 Parameters of AutoDock used in docking analysis 



Lead 




Centre 




Size (X-axis* 


Rate of gene 


Rate of 


Binding affinity 


compounds 


X-axis 


Y-axis 


Z-axis 


Y-axis* Z-axis) 


mutation 


crossover 


(Kcal/mol) 


A 


5.52 


61.778 


35.2 


40M0M0 


0.02 


0.8 


-4.8 


B 


5.846 


61.933 


35.423 


40M0M0 


0.02 


0.8 


-4.9 


C 


5.846 


61.933 


35.423 


40M0M0 


0.02 


0.8 


-8.6 


D 


5.846 


61.933 


35.423 


40M0M0 


0.02 


0.8 


-9.9 
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